TROPICAL MIXTURES OF STAR TREE METRICS 



MARIA ANGELICA CUETO 

Abstract. We study tree metrics that can be realized as a mixture of 
two star tree metrics. We prove that the only trees admitting such a 
decomposition are the ones having only one internal edge and, moreover, 
certain relations among the weights assigned to all edges must hold. We 
also describe the fibers of the corresponding mixture map. In addition, 
we discuss the general framework of tropical secant varieties and we 
interpret our results within this setting. Finally, after discussing recent 
results on upper bounds on star tree ranks of metrics on n taxa, we show 
that analogous bounds for star tree metric ranks cannot exist. 



1. Introduction 

In the present paper we investigate tropical mixtures of two star tree 
metrics. Here star tree metrics on n taxa are star trees with n leaves labeled 1 
through n, equipped with positive weights on all its edges. Tropical mixture 
of two metrics D, D on n taxa are defined as {D®D)uj\ = raax{Duj\ , j)} 
for all 1 < i,j < n, that is point-wise maxima of metrics on n taxa. This 
study was inspired by the recent work of E. Matsen, E. Mossel and M. Steel 
([S] , [1])) as well as by some open questions regarding tropical secant varieties 
of linear spaces ([3], [4j.) 

Matsen and his collaborators consider phylogenetic mixtures (convex com- 
binations of site pattern frequencies) of two weighted trees of the same topo- 
logical type, and they characterize the conditions under which these give 
rise to weighted trees of a different topology. In particular, they pay special 
attention to mixtures of star tree metrics on four taxa. Their interest in 
studying these mixtures lies in a phenomenon called "branch repulsion" . If 
we mix together two weighted trees on four taxa with the same topology, 
with short internal branch lengths and with pendant edges alternating be- 
ing long and short, we are likely to get a tree with a different topology. 
Since tree topologies are characterized by the subtrees spanned by four of 
its leaves, the case of four taxa suffices to study trees on more taxa. Intu- 
itively, from the branch repulsion behavior we see that we can approximate 
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any tree topology we want by mixing together two star tree metrics. Thus 
the interest in understanding mixtures of star tree metrics. 

In our case, rather than being interested in phylogenetic mixtures we 
focus our attention on tropical mixtures of star tree metrics. Tropical mix- 
tures arise naturally in the context of Probability Theory as approximations 
of mixtures of multivariate Poisson distributions Ch. Ill]) and logdet 
transforms defined for Markov models on trees ( [131 Thm 8.4.3], [10].) 

The convex mixture maps of two star tree metrics studied by Matsen et 
al. correspond, in the language of Algebraic Geometry, to secant varieties of 
the space of star tree metrics. If we allow negative weights for the edges of 
our star trees, then mixtures of these objects correspond to secant varieties of 
the space of star trees. Applying standard techniques in Tropical Geometry, 
one can tropicalize the convex mixture map, to obtain a piecewise linear 
map: the tropical mixture map. Its image will be the tropical first secant 
of the tropicalization of the space of star trees, which is an n-dimensional 
linear space ([3].) 

However, when dealing with star tree metrics (i.e. phylogenetic star trees) 
we should consider tropical secants of polyhedral cones instead of tropical 
secants of ordinary linear spaces. The linear space, in this case, will be 
the linear span of the cone of star tree metrics. The extremal rays of this 
cone are the cut metrics d{ assigning di(i,j) = 1 for all j ^ i and where 
all other distances are zero. As one can imagine, the sign constraints for 
the scalars reflect the difference between star trees and star tree metrics: 
the sign constraint on the edge weights. While star trees admit negative 
weights for their edges, this is forbidden in the biological framework of star 
tree metrics. 

One interesting question that arises from these two settings involves star 
tree and star tree metric ranks of a symmetric matrix. By relaxing the 
positivity condition, one can show that every metric on n taxa can be written 
as the tropical mixture of n — 2 star trees ([2].) The analogous question for 
metrics and star tree metrics seems more delicate. Indeed, we prove that 
most cut metrics on n taxa (defined by partitions of the n taxa, assigning 
pairwise distance to pairs of taxon on the same subset, and distance 1 
otherwise) are not tropical mixtures of finitely many star tree metrics. This 
implies that upper bounds for star tree metric ranks cannot exist. 

The paper is organized as follows. In Section 2 we provide the basic 
definitions on distances over finite sets and tree metrics, and we present 
our main result: a complete characterization of the intersection of the space 
of tree metrics and the image of the mixture map for star trees, together 
with a description of its fibers (Theorem 12.81 ) All proofs are deferred to 
Section [3j Since log-limits of secant varieties of toric varieties correspond to 
tropical secant varieties of linear spaces, one should expect similar results 
between convex mixtures of star tree metrics and tropical mixtures of star 
tree metrics. Our characterization shows this connection. 
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In Section 0] we switch gears and we describe the general framework of 
tropical secants of linear spaces as treated in [3]. In the case of star trees 
with arbitrary weights, the associated linear space is spanned by {r^ = 
Sj<i e ji + Ylj>i e ijA<i< n }- In the spirit of [3], we can associate to this 

linear space a point configuration in Mw. In this case, this configuration 
is nothing else than the set of vertices of the second hypersimplex A(2,n). 
We refer to [HI Ch. 9], [5, Ch. 6 §3] for historical background and basic 
properties of this well-studied polytope. Tropical secants of star tree metrics 
will have the same underlying point configuration, but the crucial difference 
between these two settings will lie in the admissible regular subdivisions 
of A(2,n), which we call positive regular subdivisions. We show they are 
different even in the simplest case of trees on four taxa. The existence of 
metrics with infinite star tree metric rank implies that regular subdivisions 
of A(2, n) need not be positive. Thus, the problem of characterizing metrics 
with finite tree metric ranks cannot be approached by studying star tree 
ranks (see [HI p 124] for a conjecture on this topic.) We finish the paper 
with several open questions on tropical secant varieties of polyhedral cones, 
which provide the natural setting to investigate mixtures of tree metrics. 

2. Basic definitions and main result 

We begin by providing notation and basic definitions. We fix [n] = 
{l,...,n}. 

Definition 2.1. A tree on n taxa is an unrooted tree (i.e. a connected graph 
with no cycles) with leaves labeled by [n]. We let E(T) be the edges ofT. 

It is well-known that trees are completely determined by their list of 
quartets, i.e. subtrees spanned by four leaves. For example, quartet (ij\kl) 
represents the subtree in Figure CD Many authors have discussed this tree 
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Figure 1: Subtree determined by quartet (ij\kl). 

representation and the minimum number of quartets required to characterize 
the tree topology (pQ, [6], §5.4.2].) 

Given a tree T, we can assign nonnegative weights to all edges e € E(T) 
via a map w : E{T) — > R + . The pair (T,w) is called a weighted tree. 
Throughout this section all trees are weighted unrooted trees on n taxa. 
Given i,j € [n] and (T,w) a weighted tree, we define the distance between 
i and j to be dr(i,j) = Ylee(i-*j) w i e )-> where e varies along all edges in the 
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unique path from i to j. We omit the subscript T when understood from 
the context. 

Definition 2.2. A dissimilarity map is a symmetric matrix D € R™ x " such 
that Da = for all i £ [n]. Alternatively, whenever it is convenient for us 



we encode D by a vector in M^ 2 corresponding the upper diagonal portion 
of the original symmetric matrix. A dissimilarity map defines a metric if it 
satisfies the triangular inequality Dij < D{ k + D k j for all triples k. 

Dissimilarity maps will measure distances between pairs of taxa. 

Definition 2.3. A dissimilarity map D is a tree metric if there exists a 
weighted tree (T,w) such that D = dx- By abuse of notation we will some- 
times denote a tree metric by its associated weighted tree. 

From the definition, it is straightforward to check that the set of dissimi- 

(») 

larity maps is a closed pointed rational polyhedral cone isomorphic to Mjj. . 
Likewise, the set of metrics is a full dimensional closed pointed rational poly- 
hedral cone contained in the cone of dissimilarity maps. However, the space 
of tree metrics is a non-convex cone, but it can be decomposed into convex 
cones. Each piece is a closed rational polyhedral cone corresponding to a 
fixed tree topology. Its extremal rays are the compatible cut metrics that 
determine the topology of the tree T. All such cones will be isomorphic to 
the orthant Htf (T)l_n (pH Prop 2.37].) 

Tree metrics are characterized algebraically by the "Four Point Condi- 
tion" PH Thm 2.36]: 

Theorem 2.4. (Four Point Condition) A dissimilarity map D is a tree 
metric if and only if for any 4-tuple i,j, k, I € [n], the maximum among 



is attained at least twice. Moreover, if the previous holds D will be realized 
by a unique weighted tree. In particular, a tree metric D is realized by a star 
tree if and only if D {j + D M = D ik + D jt = D a + D jk . 

Now let us consider mixtures of dissimilarity maps: 

Definition 2.5. Let D, D be two dissimilarity maps. We define the tropical 
mixture of D and D as {D © D)ij = max{Djj, Dij} = Dij © Dij for all 
i,j £ [n]. This generalizes to mixtures of any number of dissimilarity maps 
in the natural way. 

To simplify notation, throughout the paper we refer to this construction 
as the mixture of D and D. From the definition it is straightforward to 
see that mixtures of metrics are metrics themselves. Since the Four Point 
condition with non distinct indices provides the metric condition, we only 
need to check tuples of four distinct indices. 

We define the mixture map as the function <\> : {tree metrics} 2 — ► {metrics} 



sending (D,D) i-> D®D. Since X(D © D) = (XD) © (XD) for A G R+, it 




{D i:j + D kl , D ik + Dji, D a + D jk } 
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(a) n — 4 (b) n = 5 

Figure 2: Topology for mixtures of two star trees on 4 and 5 taxa. 



follows that the image of (ft is also a pointed cone inside R^. The com- 
mutativity of the mixture map can be interpreted by a natural action of 
§2 on pairs of tree metrics. The mixture map will be constant on orbits of 
{tree metrics} 2 under this action. The description of a mixture in terms of 
its summands is in general not unique, even after considering this §2-action. 
We will see several examples of this behavior later on. 

From now on, we will focus our attention on the restriction of (ft to pairs 
of star trees 

(ft : {star trees} x {star trees} — > {metrics}. 

As one can show, the image of this restriction will also be a pointed cone. 
The main goal of this paper is to characterize the set im(ftn {tree metrics}. 
Along the way, we determine the fibers of (ft. Since D = D © D, we already 
know that {star trees} C im(ft. Thus, we need only study when a non- 
star tree metric belongs to the image of (ft. We now present the desired 
characterization. Our building blocks with be the cases of four and five 

tclXcl. 

Lemma 2.6. Assume n = 4 and let T be quartet (ij\kl) with edge weights 
as in Figure \2a[ Then T is a mixture of two star tree metrics if and only if 
ej, Bj > g or e^, e\ > g. Moreover for each pair of inequalities, the fiber over 
T consists of four pairwise disjoint 2- dimensional families of pairs (D,D), 
which correspond to 2- dimensional polygons inM.+ , together with their orbits 
under the §2- action. All orbits will have size 2. On the contrary, if both 
inequalities hold, the two collections of families are disjoint and thus the 
fibers consist of eight families of 2-dimensional polygons, together with their 
orbits. Moreover, there is a one-to-one correspondence between the two sets 
of four families, after conveniently relabeling the leaves of the resulting trees. 

Note that the characterization involves open conditions. If we do not 
require the inequalities to be strict, the fibers over boundary points will 
consist of degenerate star trees because some edges will have weight zero. 
The families describing the fibers will intersect at these pairs of degenerate 
star trees. 

It is interesting to compare the result of the previous lemma with the 
analogous one obtained in [8, Prop 2.2] for convex mixtures of two star trees 
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on four taxa. Tree metrics that are convex mixture of two star tree metrics 
are exactly the star trees and the quartets where the internal branch length 
is shorter than the sum of the branch lengths for any pair of non-adjacent 
edges. 

Lemma 2.7. Assume n = 5 and let T be a non-star weighted tree on five 
taxa. Denote by the weight of the edge attached to leaf i . Then, T is a 
mixture of two star tree metrics if and only if T has only one internal edge 
(labeled g), and ej,ej > g, where i,j are the leaves of the unique cherry 
attached to one endpoint of the internal edge (see Figure \2b\) Moreover, 
the fiber over such a tree T is the union of four disjoint §2- orbits of 2- 
dimensional polygons corresponding to the pairs (D,D). 

In particular, from the lemma we know that no trivalent tree metric is a 
mixture of two star tree metrics. The characterization for general n follows 
from the previous two lemmas. 

Theorem 2.8. Let T be a non-star tree metric. Then T is a mixture of two 
star tree metrics if and only if 

(i) n = A, T is quartet (ij\kl) and &i,ej > g or e^,e^ > g, where g 
denotes the weight of the internal edge (see Fiaure [3a]) ; 

(ii) n > 5, T has only one internal internal edge with weight g, and 
if I and J are a partition of n into labels of leaves attached to the 
left-hand side and right-hand side ofT, respectively, and we assume 
\I\ < \J\, we have two cases: 

(a) |/| = 2 and e^e^ > g (see Fiaurel3b^): 

(b) \I\ > 3, and ej > g for all i € [n] (see Fiaure[3c\). 

Moreover, for each one of these trees we can describe the fiber in terms of 
four or eight disjoint §2- orbits of 2- dimensional polygons corresponding to 
families of pairs (D,D). 

3. Proof of Lemmas 12.61 12.71 and of Theorem 12.81 

Before presenting the proofs of the two main lemmas, we need an algebraic 
characterization of the topology of three metrics that equal D © D in terms 
of the entries of D and D. Following the approach in [9], our description 
involves relative differences among the entries of both metrics. We denote 
by a,c,d,e the weights of the edges in D, whereas a,c,d,e are used for the 
edges in D as illustrated by Figure libl We label the middle edge of (12|34) 
by b (see Figure l4al) 

Since the topology of star trees is highly symmetric we need only treat 
the case of fibers of the mixture map over the quartet (12|34). The general 
case will follow by relabeling. As an aside remark, note that since the set 
im</> n {T : T = (12|34)} is a cone, we should expect our characterization to 
be invariant under multiplication by a positive scalar. Indeed, this will be 
the case. 
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(c)3<|/|=p<g=|J| 



Figure 3: Tree topologies of mixtures of two star trees. 
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(a) 




Figure 4: Labeling of star tree and quartet (12|34) for n = 4. 



From the labeling of edges in the star trees associated to D and D we 
obtain two 4x4 nonnegative symmetric matrices corresponding to these 
tree metrics. To simplify notation we only show the coefficients above the 
diagonal: 



D :-- 



/ a + d a + c a + e\ 

d+c d+e 

c + e 

V o / 



D 



/0 a + d a + c a + e \ 
d+c d+e 
c + e 



\ 



/ 
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Consider six new indeterminates s,t,x,y,u,w and the following equations 
relating the entries in D and D, which express the relative differences be- 
tween these two matrices. 



(1) 



a + c = 


a + c + s, 


d + e = 


d + e + t, 


a + e = 


a + e + x 


d + c = 


d + c + y, 


a + d = 


a + d + -u 


c + e = 


c + e + w 



Denote s + = max{s,0} and similarly for the other five new variables. 
Observe that max{L>i3, -D13} = D13 + s + . Thus T = D D will satisfy: 

{7i2 + T34 = a + d + u + + c + e + w + , 
7i 3 + T 24 = a + c + s + + d + e + t+ 
2l4 + T 23 = a + e + x + + d + c + y + . 

From dl]) and the fact that D and D are star trees, these new variables also 
satisfy two linear equations: s + t = x + y = u + w. The next result will be 
crucial in the proof of Lemma 12.61 

Proposition 3.1. With the above notation, T is the quartet (12|34) if and 
only if 

u + + w + < s + + t + = x + + y + . 

Proof. Follow immediately from the Four Point Condition. □ 

Proof of Lemma \2. 61 Our strategy goes as follows. We assume T = D D 
to be the quartet (12|34), and we find necessary and sufficient conditions 
on the weights of T for this to hold. Our unique restrictions are given 
by requiring all values of edge weights to be positive because we consider 
non-degenerate star tree metrics. We show that these conditions suffice to 
explicitly construct D and D from T. 

Since the mixture map is symmetric, without loss of generality we may 
assume s > (if not, switch the role of D and D.) Thus, we need to analyze 
two cases according to the sign of t. In what follows, we make full use of 
Proposition 13.11 

If t > 0, we need s + t = x + y and s + t = x + + y + . Thus x, y > 0. 
Likewise s + t = u + w but u + w < u + + w + < s + 1, a contradiction. Hence 
t < 0. Switching the roles of s and t we also conclude that they both are 
nonzero. 

By symmetry on the leaves of (12|34), to expect this topology for T we 
must have both pairs (s,t),(x,y) with coordinates of opposite sign. Hence, 
without loss of generality we may assume s > and t < 0. In addition, 
since x + y = s + t<s and s = x + + y + we must have either x = s,y = t 
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s. Assume the first conditions hold. By analyzing ([I]) we 

w w w - 

s = x>0,t = y<U =4> c = c+— -, e = e + — -, a = a + s — — -, a = d+y- 



or x = t, y 
conclude: 



w 
~2 ' 



Likewise, for the second conditions: 

u 



y > 0, t = x <0 



a 



a + -, d 



d+-,C- 



c + s- 



u 



e + x- 



u 

2' " " ' " 2' 

We now consider the sign patterns of u and u>. Assume u, w have opposite 
sign. Then from u + w = s + t we conclude u + + w + < s if and only if 
(0 < u < s and w = s + t — u<0)or(0<w<s and u = s + 1 — w < 0) . In 
case both u and u; have the same sign, we have no restrictions other than 
u + w = s + t. 

Let us compute the weights of the edges in T S (12|34), following the 
labeling of Figure [2a] with % = 1, j = 2, k = 3, 1 = 4: 



(3) 



9 



e2 := 



e3 



e 4 



-(T14 + r 23 

^(Ti4+T 13 

- (T24 + T23 

2(^13 + T23 
1 



(Ti2 + T 34 )) = ^( S -(n + + U ; + )), 



25 - T 34 ) 

25 - r 34 ) 

2g - T 12 ) 



a H 1 , 

2 2' 

d + T"2 + T' 

c H 1 , 

2 2 



(T14 + T 24 -2g- T 12 ) = e + 



x 



s 

2 + ~2 



We compute each indeterminate in (|3|) according to our two choices for 
pairing s, t, x, y together: 

• Case 1: s = x > 0, t = y < 0. We obtain: 

s u + s u + w + w + 

e i = G + 2+ ^-' e 2 = d~- + — , e 3 = c+ — , e 4 = e + — . 



Case 2: s = y>0,t = x<0. Likewise: 



ei 



1/ 



e2 



S 1£T 

cH 1 

2 2 



e-i 



2 + T 



We subdivide these two cases according to sing constrains: s > u > and 
w < (cases 1.1 and 2.1), s > w > and u < (cases 1.2 and 2.2) or u, w 
with the same sign (positive in cases 1.3 and 2.3 and negative in cases 1.4 
and 2.4). In this way we get expressions for all our indeterminates and we 
obtain necessary conditions they must satisfy. 

Remark 3.2. Notice that Cases 1 and 2 are symmetric. More precisely, they 
are equivalent if we change the labels for leaves 1 and 2 by 3 and 4 respectively 
and we also change the variables (a,d,u,w,x,y) by (c,e,w,u,y,x), while 
leaving s and t fixed. One needs to make these replacements between suitable 
pairs: 1.1 & 2.2, 1.2 & 2.1, 1.3 & 2.3 and 1.4 & 24 respectively. This 
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correspondence will be crucial for the proof of Lemma \2 L 
only need to study cases 1.1 through 1.4- 



Therefore, we 



Cases 1.1-1.4: 



a + s 



L , d 



d + u+ f 



s, c 



c + f, e 



e + 



2 ' 



Case 1.1: Suppose s > u > and w < 0, 



9 
ei 

C2 

e 4 



s— tt 
2 

a + 
d- 
c, 
e. 



S_ _i_ u 

2 "T" 2' 
s I u 

2 + 2 ' 



Since a > 0, u > the previous equations yield ej > 5. On the other hand, 
by imposing all variables a, d, e, c and to be positive we conclude some 
necessary restrictions among a, d, c, e, u, w, s and t. Namely, d > s — u — f = 
2g — f > 2g and c, e > — f > 0. Finally, e2 = d — g > g. Therefore, we 



conclude ei, e2 > g, d > s 



and c, e > 



Case 1.2: Suppose s > w > and u < 0, 



2 • 



9 


s—w 
~ 2 ' 


ei 


= a + f, 


e2 


— d - - 

2 ' 


e3 


= c+f, 


e 4 


= c + f 



In this case, from a > and w > we also have ei > <?. In a similar 
way to the previous case, from u < and < w < s we obtain additional 



necessary conditions: d > s 
e 2 = d- ^>g-u>g. 
Case 1.3: Suppose u, w > 0, 



f (because s 



— > 

2 ^ 



Again, 



f 


~~ 2 


tt") 


9 


ei 


= « + f 


+ 


< e 2 


= d -f 


+ 


e3 


= c+f 


5 


k e 4 


= c + f 





2 ' 
u 

2 ' 



As before, we have ei > 5, and we obtain the extra conditions d > s- 



a 2 , 



e 2 = d- f + f>s-u- 


w 
2 


S—U 

2 " 


= 5- 




Case 1.4: Supposes, w 


<0 












( 

9 




s 

2' 






ei 




a + f, 




< 


e2 




n! — 2. 
a 2 , 






e3 




c, 






k e 4 




e. 


Here we have e\ > g and 


we ^ 


;et c, e > - 


-- d > 

2 , u, ^ 



■ 1i ■ 



It 



111 
2 



2^2 



1£ 
2 ' 



thus e 2 = d — I > 



> 9- 
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Changing variables as stated in Remark 13.21 we get analogous results 
for cases 2.1 through 2.4. In particular, we obtain e3,e4 > g and a lower 
bound for e in terms of s,w and u. Hence, if (12|34) is a mixture of two 
non-degenerate star tree metrics we conclude ei, &2 > 9 or es, > g. 

For the converse, we assume that T is quartet (12|34) and it satisfies 
ei,e2 > g or e^^e^ > g. Our goal is to construct all possible pairs D, 
D giving T = D © D. From the "if direction" we know that such pairs 
(-D, D) come in four or eight two dimensional families, depending on which 
conditions on the weights of T hold. Moreover, we have a description of these 
families so we only need to check if we can find values for the indeterminates 
satisfying the sign restrictions. 

We claim that cases 1.1 through 1.4 are admissible if and only if e\ y e,% > g. 
We already know the "only if direction" from the previous discussion. We 
now show the converse holds: 

Case 1.1: Assume ei,e2 > g. We want to determine the values of a,c,d,e 
and a, c, d, e so that T = D ffi D. From the previous equations we easily 
obtain e = c = and d = ei + g. From the conditions e% > — ^ we 
see that > w > — 2e^, 2e^, although we still do not have an exact value for 
w. 

On the other hand, we have s = u + 2g. Hence, a = e± — = e\— g — u, 
a = e\ + g — ^, d = ei — g + ^f, c = + ^ and e = + y. The only 
conditions on u, w are determined by the positivity of all weights in D and 
D, namely, u, w are free parameters satisfying > uu > — 2min{e2 — g, e^, e^} 
and e\ — g > u > 0. The sign patterns for s, t will hold automatically. Hence 
our necessary and sufficient conditions are &\ , &i > g. 

Case 1.2; As in the previous case, assume ei,e2 > g. By the same proce- 
dure we get c = e3, e = e&, s = 2g+w, c = e^ — ^, e = e^ — a = ei—g — ^, 
d = e2 + g + ^,a = ei+g and d = e2 — g + u were u, w are two free param- 
eters satisfying > u > g — arid 2min{e3, e&, e\ — g} > w > 0. Therefore 
in this case our necessary and sufficient conditions are e\, > g as well. 
Case 1.3: Suppose e±, e% > g. From scratch we have e = and c = e^. We 
let u, w > be two free parameters, so c = e^ — ^, e = e^ — ^, s = 2g + u+w, 
a = e\ — g — u — ^, a = e± + g, d = e2 + g + ^ and d = e2 — g- 

In particular, the only restrictions over u, w are < w < 2 min{e3, e^, e\ — 
g} and 0<u<e\— g — ^. 

Case 1.4: Assume e±, ei > g. Then s = 2g, a = e± — g, d = ei + g, c = 
and e = 64. Our free parameters are u, w < 0. We get a = e± + g — j, 
e = e4 + ^, c = e3 + ^ and d = &2~ g + u+ ^. Therefore, u, w must satisfy 
> w > — 2 min{e3, e&, e2 — g} and 0>u>g — e2~ ^- 

By the symmetries mentioned in Remark 13.21 we have similar results for 
cases 2.1 through 2.4. The necessary and sufficient conditions are e3,e4 > 
g. We also obtain u, w as free parameters and their linear restrictions are 
translations of the restrictions from the corresponding previous four cases. 

Therefore, we have three scenarios for the fibers of the mixture map: 
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(i) ei, e 2 > g does not hold. Then e3, e4 > g and the fiber of the mix- 
ture map is the union of the four families described by cases 2.1 
through 2.4, and their orbits under the action of E> 2 (which inter- 
changes D and D.) All four families and their orbits are disjoint. 
By construction, each family is irreducible in the Zariski topology 
(affinely isomorphic to a product of intervals, or a triangle in M 2 .) 

(ii) e3,e4 > g does not hold. Then ei,e 2 > g and the fiber of the 
mixture map is the union of the families from cases 1.1 through 1.4, 
and their orbits. As before, these eight families are irreducible and 
disjoint. 

(iii) ei,e2,e3,e4 > g. Then the fiber consists of the orbits of all eight 
families given by cases 1.1 through 2.4. All orbits are disjoint. 

□ 

As a reference for the proof of Lemma 12.71 we include the description of 
all eight cases below. The free parameters are u, w. 

• Case 1.1: e% — g > u > and > w > — 2min{e2 — g, e 4 , 63}; 

f a := ei - g - u; d_:=e 2 + g; c := e 3 ; e := e 4 ; 

\a:= ei +g-f, d:=e 2 - + f; c := e 3 + f ; e := e 4 + f . 

• Case 1.2: > u > g — e 2 and 2 min{e3, e4, ei — g} > w > 0; 

|a:=ei-5f-f; (i:=e 2 + 5' + f; c := e 3 - f ; e := e 4 - f; 
\ a:=e\+g; d := e 2 - g + u; c := e 3 ; e := e 4 . 

• Case 1.3: ei — 5 — ^ > n > and 2 min{e3, e^, ei — g} > w > 0; 

( a := ei - 5 - it - f ; d:=e 2 +g + f; c:=e 3 -f; e := e 4 - f ; 
\a:=ei+g; d := e 2 - g; c := e 3 ; e := e 4 . 

• Case 1.4: 0>u>g — e 2 — ^ and > ui > — 2 min{e3, e4, e2 — <?}; 

(a:=ei-g; d_:=e 2 + g; c := e 3 ; e := e 4 ; 

\a := ei + 5 - f; d := e 2 - + u + f ; c:=e 3 + Tr; e := e 4 + f . 

• Case 2.1: 2 min{ei, e2, e3 — g} > u > and > w > g — e^; 

fa:=ei-|; d:=e 2 -|; c:=e 3 -5-|; e := e 4 + g + |; 
\ a:=ei; d := e 2 ; c := e 3 + g; e:=e4,-g + w. 

• Case 2.2: > u > —2 min{e4 — g, e\, e 2 } and 63 — g > w > 0; 

fa:=ei; ^ :=e 2; c := e 3 - 5 - e:=e 4 + 5; 

\a:=ei + f; d:=e 2 + |; c:=e 3 + 5-|; e:=e 4 -c/ + f. 

• Case 2.3: 2 min{e2, ei, e 3 — g} > w > and — g — |>w>0; 

fa :=ei-|; 4 :=e 2-|; c := e 3 - - § - w; e:=e 4 + g + f; 
\a:=ei; d := e 2 ; c := e 3 + g; e:=e 4 -g. 
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• Case 2.4: > u > — 2min{ei,e2, e± — g} and > w > (g — e$) — |; 

f a := ei; 4 :=e 2! c:=e 3 -g; e:=e 4 + g; 

\ a := ei + §; d := e 2 + |; c:=e 3 +g-§; e := e 4 - 5 + | + w. 

Remark 3.3. From the description of all eight cases one can show that, 
even when considering the commutativity of the mixture map, the sets cor- 
responding to each case are pairwise disjoint. Notice also that in all eight 
cases, one of the star trees has at least three of its branch weights independent 
of the values ofu,w. 

Proof of Lemma \2. 7| We will make full use of the results on four taxa by 
considering the quartets describing our non-star tree metric T. By symmetry 
on the taxa, we need only consider the two cases illustrated in Figure O We 
treat each case separately analyzing the possible quartets on these trees. 
The overall strategy consists of describing the fibers of the mixture map on 
four taxa and find compatible fibers over each quartet to build the original 
tree by gluing the partial liftings. 




2 4 2 4 



(a) (b) 
Figure 5: Non-star trees on five taxa, up to permutation of taxa. 

□ 

Proof for Figure \5ai Assume T as in Figure [5al As we said before, we have 
necessary and sufficient conditions for each quartet in T to be a mixture of 
two star tree metrics. We work with quartets (12|34), (12|35) and (12|45). 
Since the first two quartets involve leaves 1 and 3, we can always assume 
that s > 0. However, for the third quartet we need to consider both signs 
for s. 




4 2 4 



Figure 6: Edge weights in star tree and in tree T for n = 5. 

Assume T = D © D. Let g be the weight of the middle edge and ei the 
weight of the edge adjacent to leaf i, whereas we label by a, c, d, e, f the 
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weights on the star tree associated to D and similarly for D (see Figure El) 
Therefore, by considering the restrictions to the three mentioned quartets 
we see that the underlying tree T must satisfy the following conditions: 

ei,e 2 > g or e 3 ,e 4 ,e 5 > g. 

Since we have a complete characterization of mixtures on four taxa, we 
construct our candidates for the mixtures given each quartet. To finish, 
we glue our partial star trees together (whenever possible) to build our 
candidates D,D on 5 taxa. We show this can be achieved by exhaustive 
case by case analysis of the fibers over each quartet. 

Assume ei,e2 > g. Therefore, we know that at least we have to analyze 
cases 1.1 up to 1.4 for the three quartets (Recall that the order of D and D 
is determined by the sign of s). We also treat cases 2.1 through 2.4 in case 
we have e 3 , e 4 > g. 

Suppose (12|34) satisfies case 1.1. Then we have 

( a = ei-g-u, c = e 3 , d_=e 2 + g, e = e 4 , 

\a = e 1 + g-%, c = e 3 + f, d = e 2 -g + ?§, e = e 4 + f, 

where the free parameters are u,w s.t. > w > — 2min{e2 — g, e 4 ,e 3 } and 
< u < ei — g. 

For quartet (12 1 35) we have the same value of s = a + c — a — c > 0, so we 
do not need to switch the order of D and D. We claim that the fiber over 
( 12 1 35) can only correspond to case 1.1. Assuming cases 1.2 or 1.3 we see 
that e\ + g — ^ = a = e\ + g gives w = < 0, a contradiction. Likewise, 
for case 1.4 we have e\ + g — ^ = d = e\+g — \ implies w' = w and so 
^2 — 9 + ■f = d = e 2 — g + u' + gives u' = < 0, a contradiction. Cases 2.1 
and 2.3 cannot hold because they would imply e 3 + g = c = e 3 + ^ (w < 0). 
Likewise cases 2.2 and 2.4 cannot hold since they would yield e 2 = d = e 2 +g 
(g > 0.) Thus by joining the values from cases 1.1 for both quartets we get 
v! = u and w' = w with compatible values for all weights in D and D. 

To analyze quartet (12|45) we need to compute the difference (a + /) — 
(a + /), since its sign determines the order of D and D in expression (pQ). We 
obtain a + f = e\ + e^— g — u and a + / = &\ + es + g, thus a + f — a — f = 
2g + u > 0. Therefore, we do not need to switch the star trees in our 
computation of the edge lengths for this quartet. 

As before, there is only one possibility for quartet (12 145) : case 1.1 must 
hold. The proof is analogous to the one for the quartet (12|34) because we 
know that the order of the mixture is preserved. 

Therefore, if ex,e 2 > g and we are in case 1.1 for the quartet (12|34) we 
have that T = D © D where 

(a = ei - g - u, c = e 3 , d_=e 2 + g, e = e 4 , / = e 5 , 

\a = ei + g-%, c = e 3 + f, d = e 2 -g + %, e = e 4 + f, /=e 5 + f, 

and u, w are free parameters satisfying 0<u<ei— g, > w > — 2 min{e2 — 
g, e 4 ,e 3 ,e 5 }. 
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On the other hand, assume case 1.2 holds for (12 134) . Like we observed 
when studying the previous quartet, we know that case 1.2 must hold for 
(12 1 35) (compare the weight of a for cases 1.1, 1.4, 2.2 and 2.4, d for the 
case 1.3 and c for cases 2.1 and 2.3.) Therefore we get w' = w and u' = u 
where > u > g — e 2 and 2 min{e3, e 4 , e$, e\ — g} > w > on both quartets 
with compatible values for all weights. 

As before we need to compare a + / with a + f . We have a + f = 
a + f + (2g + w), and so we do not need to interchange D and D. Therefore, 
the only possibility for quartet (12 145) is case 1.2 which is compatible with 
the weights obtained before. Hence, T = D © D, where 

(a = e!-g-%, d_=e 2 +g+%, c = e 3 -f, e = e 4 - f_ = e 5 -%, 
\a = ei+g, d = e 2 -g + u, c = e 3 , e = e 4 , / = e 5 , 

and u, w are free parameters satisfying > u > g — e 2 and 2 minjei — 
9,e4,e 3 ,e 5 } > w > 0. 

Next, assume case 1.3 holds for (12|34). As before, the same case also 
holds for (12 135) and we get u = u', w = w' where 2min{ei — g, e±, e^, > 
w > and e.\ — g — ^ > u > and the values of all edge weights are 
compatible. In addition, a + f — a — f = 2g + (u + w) = s > 0, so D and D 
do not switch. Hence, case 1.3 must also hold for (12|45) and we conclude 
T = D @ D with 

(a = ei -g-u- f, d_= e 2 + g + f , c = e 3 -f, e = e 4 - f , / = e 5 -f, 
\a = ei + ff, d = e 2 -g, c = e 3 , e = e 4 , / = e 5 , 

and it, u; are free parameters satisfying 2 min{ei — p, e 4 , e 3 , es} > u; > and 
ei-0-f>it>O. 

To finish, assume (12|34) satisfies case 1.4. Then as before the same case 
holds for (12|35) and we obtain compatible values for all weights. In this 
case, a + f — a — f = 2g > 0, so (12 145) also satisfies case 1.4. Therefore 
T = D®D with 

ja = ei-g, d_=e 2 +g, c = e 3 , e = e 4 , / = e 5 , 

]a = ei + <7-f, J=e 2 - 5 + u + f, c = e 3 + f, e = e 4 + f, / = e 5 + f, 

and u, are free parameters satisfying > w > — 2min{e2 — g,e 4 ,e 3 ,e5} 
and > u > g — e 2 — ^. 

Next, we analyze cases 2.1 through 2.4 for quartet (12|34). By symmetry 
on the leaves 4 and 5, we know that cases 2.1 through 2.4 must also hold 
for quartet (12 135) . 

Assume case 2.1 for quartet (12|34). By looking at a = ei, we see that 
cases 2.2 or 2.4 cannot hold for (12|35). Thus we have either cases 2.1 or 
2.3. Suppose case 2.3 holds for (12 135) , we get e\ — | = a = e\ — y, so 
u' = u and from c we get e 3 — g — ¥■ = e 3 — g — ¥■ — w' so w' = 0. Thus: 

fa = ei-f, 4=e 2 -f, c = e 3 -g-%, e = e 4 + g + %, f_ = e 5 +g+%, 
]a = ei , d = e 2 , c = e 3 +g, e = e 4 - g + w, f = e 5 - g. 
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We have a + / = e\ + e$ — g = a + f — 2g, so we must switch the roles 
of D and D for quartet (12|45). By looking at the weight e for cases 2.1 
and 2.3 we see e = e$ + g + ^ = e§ — g + w with u" > 0, w < 0, g > 0, 
a contradiction. Likewise, for cases 2.4 and 2.2: e = e 4 + g = e 4 — g + w, 
w < 0, a contradiction. For cases 1.1 through 1.4 we use the value of /, /: 
/ = es or / = es compared to the known values / = es + g + | , f = e$ — g 
(u > 0, g > 0.) Note that these contradictions only involve the values of e 
and /, / that come from (12|34) and (12|35), and the fact that we had to 
switched D and D for the quartet (12|45). 

Therefore, case 2.1 holds for both quartets (12|34) and (12|35), and so 

fa = ei-f, 4=e 2 -f, c = e 3 -s-§, e = e 4 + g + %, / = e 5 + £+f, 
\a = ei, d = e 2 , c = e 3 + g, e = e 4 - g + w, f = e 5 - g + w' , 

with iy, to' < and u > 0. Now a + / = e\ + e^ — g + w' = a + / + (— 2g + w/) 
and —2g + w' < 0, so we must switch again D and Z) for the quartet (12|45). 
The same proof we gave using the values of edges e and / or / shows that 
none of the four possible cases can hold for (12|45). Thus, we conclude that 
case 2.1 does not hold for (12|34). 

The remaining cases will follow the same path of ideas, using the value of 
the edge e, c and the switching vs non-switching of D and D on the quartet 
(12|45). Suppose case 2.2 holds for quartet (12|34). From the weight a we 
see that cases 2.1 and 2.3 cannot hold for (12 135) , so we need to analyze 
cases 2.2 and 2.4. Assume case 2.4 holds for quartet (12 135) . Then from the 
value of a and c we see that u' = u, w = and so 

la = ei, d_=e 2 , c = e 3 -g, e = e 4 + g, f_ = e 5 +g, 

\a = ei + |, d = e 2 + |, c = e 3 + g - § , e = e 4 - g + |, / = e 5 - 5- + | + «/, 

with n, w/ < 0. We get d + f — a — f = —2g + u + i// < 0, so we need to 
switch the roles of D and I) in (12|45). 

Let us look at the value of e from (12|45) for each one of these cases. For 
case 2.1: e = e 4 + g = e 4 — g + w" with w" < 0; for case 2.2: e = e 4 + g = 
e 4 — g + Tj- with u" < 0; for case 2.3: e = e 4 + g = e 4 — 5, <7>0 and for case 
2.4: e = e 4 + <? = e 4 — g + ^- + w " with u", io" < 0. All four expressions 
lead to contradictions. Likewise, for cases 1.1 and 1.4: e = e 4 + ^ = e 4 + g 
(w" < 0) and for cases 1.2 and 1.3: e = e 4 = e 4 + g, which cannot occur. 

Therefore, case 2.2 must hold for (12|35). In particular / = + g and 
/ = es — g + t| (u = u' < 0). Thus, a + f — a — f = —2g + u < 0. Hence, we 
must switch .D and i) for (12|45). The same argument using the value of e 
at (12|45) leads to a contradiction. Hence case 2.2 cannot hold for (12|34). 

Assume case 2.3 holds for quartet ( 1 2 1 34) . By looking at edge a = e± we 
see that cases 2.2 or 2.4 cannot hold for (12 1 35) (because u' < 0), leaving us 
with cases 2.1 and 2.3 as possible candidates. Suppose case 2.1 for (12|35). 
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Then by looking at weights a and c we have u = u' and w = 0. Thus 

( a = ei - %,d = e 2 - %,c = e 3 - g - %,e = e 4 + g + %J_ = e 5 + g + 
\a = ei, d = e 2 , c = e 3 +g, e = e 4 -g, f = e 5 -g + w', 

with u > 0, w' < 0. Then a + f — a — f = —2g + w' < 0, so we must 
interchange D and D for the quartet. If cases 2.2 or 2.4 hold for (12|45) we 
get e = e 4 — g = e 4 + g, a contradiction. Likewise if cases 2.1 or 2.3 hold, we 
get e = e4 — g = e 4 + g + \ an d > 0, again a contradiction. Therefore, 
our only choice is case 2.3. Note that cases 2.1 and 2.3 share the same value 
for e = e 4 — g, so the only thing we need to know to ensure that case 2.3 
cannot hold for (12|35) is to guarantee that we must switch D and D and 
that cases 1.1 through 1.4 are invalid candidates for (12|45). 

Since / = e^+g+^ and / = eg — g (we know v! = it), we get a+ f — a— f = 
—2g, so D and D must switch roles in quartet (12|45). For cases 1.1 and 1.4 
we have e = e 4 = e 4 — g, whereas for cases 1.2 and 1.3: e = e 4 = e 4 + g + ^ 
with u > 0, a contradiction. Therefore, we conclude that case 2.3 cannot 
hold for (12 1 34). 

To finish, assume case 2.4 for (12|34). By looking at weight a we know 
that 2.1 and 2.3 are forbidden cases for (12 135) . Assume case 2.2 holds. By 
considering weights c, c we have u = v! < and w' = 0. Thus 

fa = ei, d_=e 2 , c = e 3 -g, e = e^ + g, f_ = e 5 +g, 

|a = e x + f ,d = e 2 + | ,c = e 3 +g - | ,e = e 4 - 5 + | = e 5 - 5 + | • 

Now a + f — a — f = — 2g + m < 0, so we must interchange Z) and 5 
for quartet (12|45). We compare the value of weight e for all four possible 
cases for quartet (12 145) . For cases 2.1 and 2.3 we get e = e 4 — g + ^ + 
w = e4 + g + tj- with u" > and u, w < 0, whereas for cases 2.2 and 2.4 
e = e 4 — g + ^ + W = + g, so we get a contradiction. Likewise, for cases 
1.1 and 1.4: 6 = 64 + ^ = + g (w" < 0) whereas for cases 1.2 and 1.3: 
e = e4 = e4 + g, also a contradiction. Therefore, case 2.4 must hold for 
(12|35). In particular, we know f = e$ + g, f = e$ — g + ^ + w" (u" = u) 
and so a + / — a — f = u + w" — 2g < 0, thus D and D must switch roles in 
quartet (12 145) . However, since we have the same value for e as before (it 
was computed from quartet (12(34)) the same proof we gave when assuming 
case 2.2 shows that case 2.4 is not allowed for (12|35). 

From the previous analysis we conclude: if ei,e 2 > g, then the fiber over 
T consists of cases 1.1 through 1.4 for all three quartets. Conversely, assume 
T is the mixture of D and D. If e\,e 2 > g does not hold, this would imply 
that cases 2.1 through 2.4 are the only candidates for (12|34). But from the 
construction this cannot happen. Therefore, we conclude ei,e 2 > g. □ 

Proof for Figure [5M We now change gears and we analyze the topology of T 
as in Figure I5bl We label the edges of T and of the star trees as in Figure [71 
We will show that no tree with this topology is a mixture of star trees. 
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Figure 7: Weights for edges in star tree and T for n = 5. 



Assume the contrary, and let T be such a point. Then by considering 
quartet (12|34) we know that e\, e2 > g + h or e^, > g + h. Without loss 
of generality we may assume e±, e<i > g + h. Now, consider quartet (12|53). 
Since e±, ei > g + h > g, we have to analyze all eight cases for (12 1 34) and 
(12|35). 

Suppose case 1.1 holds for (12(34), so d = ei + g + h. If cases 1.1 or 1.4 
hold for (12|35), we get e2 + g = d = e2 + g + h, because the middle edge 
weight for (12|35) is g. This contradicts h > 0. Likewise, for cases 2.1 and 

2.3 we get d = e2 + g + h = e2 — \ (u' > 0), whereas for cases 2.2 and 2.4: 
d = e2 + g + h = e2, a contradiction. Hence, cases 1.2 or 1.3 must hold for 
(12 1 35). For any of them we look at the weight c and we get c = e3 = e3 + i| 
which contradicts w < 0. Thus, case 1.1 is not allowed for (12|34). 

Next, suppose case 1.2 holds for (12|34). Then if we assume cases 1.1 or 

1.4 for quartet (12|35) we get d = e2+g = e^+g+h+^ with w > and h > 0, 
a contradiction. Likewise, for cases 1.2 and 1.3 we get a = e\ + g + h = e\+g 
which cannot happen. For cases 2.1 and 2.3: a = e\ + g + h = e±, whereas 
for cases 2.2 and 2.4: a = e± + g + h = e\ + ^ [u 1 < 0), which cannot occur. 

Now assume case 1.3 for (12|34) , so in particular a = e\ +g + h. Similarly 
to the procedure for case 1.2, if we assume cases 1.1 or 1.4 for (12|35) we 
get d = e\+g + h = e\+g — ^ where w' < 0, a contradiction. Likewise 
from cases 1.2 or 1.3 we see a = e\ + g + h = e± + g, which is also a 
contradiction. Since the value of o at (12|34) for cases 1.2 and 1.3 is the 
same, the arguments discussed in the previous paragraph discards cases 2.1 
through 2.4 for (12|35). 

To finish, suppose case 1.4 for (12 [34). As in the analysis of case 1.1, if 
cases 1.2 or 1.3 hold for (12 1 35) we have c = = + ^ , with w < 0, a 
contradiction. For cases 1.1 and 1.4, we look at d = e2 + g + h = e2 + g, 
so h = which cannot occur, whereas for cases 2.1 through 2.4 we use the 
same argument given in the analysis of case 1.1 for quartet (12|34). 

We now analyze the cases 2.1 through 2.4 for the quartet (12|34). By 
symmetry, (12|35) can only satisfy cases 2.1 through 2.4 as well. For cases 
2.1 and 2.3, we work with weight c = e% + g + h and get contradictions 
for all possible expressions of quartet (12 135) . For cases 2.2 and 2.4 we 
consider a = e± + | (u < 0) to rule out cases 2.1 and 2.3. From expression 
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o = ei + f = ei + Y (it coincides for both 2.2 and 2.4), we conclude u = u', 
and from weight c we get e^+g + h — | = e^+g — | , contradicting /i > 0. □ 

Remark 3.4. iVo£e i/iat in all four cases of the previous proof we found that 
the three quartets satisfy the same case, either 1.1, 1.2, 1.3 or 1.4 and the 
parameters involved on different quartets were the same. This observation 
will be essential to prove Theorem \2.8[ 

Corollary 3.5. Assume T £ im<f)Pi {non-star tree metrics}. Then the fiber 
of <j) over T is characterized as follows. If we fix the condition > D13 
(i.e. s > 0), there are four different and disjoint 2-dimensional polyhedral 
families giving the possible mixture decompositions of T = D © D. They 
correspond to cases 1.1 through I.4 in the proof of Lemma \2.6[ applied to 
each one of the quartets ofT. 

Proof of Theorem \2.8[ We need only prove item (ii) since (i) is contained in 
Lemma 12.61 Moreover, for n = 5 we can only have |/| = 2 and the result 
was proved in Lemma 12.71 Thus we need only analyze the case n > 6. 

From Lemma 12. 7\ it is clear that we have only one topolog y on n taxa 
that is likely to be in im cp n {tree metrics} in addition to the star topology. 
Namely, the one given in Figure O two star trees with labels in the disjoint 
sets I = . . . , i p } and J = . . . ,j q } with I U J = [n], glued together 
by one internal node of positive weight. Call this weight g. By symmetry 

Ji 

h 

jq-2 
jq-l 

h 

Figure 8: Non-star candidate T for a mixture of two star tree metrics on n 
taxa. 

on the taxa, we can assume 1,2 El and 3,4 E J. Moreover, assume 
= P < Q = \J\. We claim that we have only two cases to deal with: 
\I\ = 2 or |/| > 2. 

Restrict T to the quartets (12|34) and ( 1 2 1 3 j ) , with j > 4. Then, as we 
saw in the proof of Lemma 12.71 and stated in Remark 13.41 both quartets 
satisfy the same case (among cases 1.1 through 1.4) and ei,e2 > g. By 
symmetry, the same case will hold for all quartets (12\kl). 

Assume |/| = 2. By gluing the previous partial solutions together we get 
four families of expressions for T = D © D where -D13 > -D13. In particular, 
T € im</> n {non-star tree metrics} if and only if e\,C2 > g. 

On the contrary, assume |/| > 2. Therefore, the roles of / and J are 
symmetric. We conclude that > g for all i € [n] and that all quartets 
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(&Z|34) (resp. (12|fcZ)) satisfy the same case among cases 2.1 through 2.4 
(resp. cases 1.1 through 1.4.) The correspondence between cases mentioned 
in Remark ()3.2f) guarantees that we can construct D and D as in the case 
of five taxa. □ 



4. Tropical secant varieties and the space of star trees 

In this section we relate tropical secant varieties and mixtures of finitely 
many star trees on n taxa. For the basic definitions on tropical geometry we 
refer the reader to [12j . For applications to phylogenetics we recommend [11} 
Ch2-3]. 

It is a well-known fact that the space of star trees is the tropicalization 
of a toric variety. More precisely, of a torus translate of the image of the 
monomial map 

(4) V : C[tf, tt 1 } -> C[xf :l<i<j<n] ^(t) = = . 

Thus, the equations describing the torus translate by (oiij) i - G (C*)(a) of 
the image of ^ generate the ideal 

I = — ctijtitj : 1 < i < j < n) O C[x^ : 1 < i < j < n] = 

a ik aji Oi ik aji 

where we identify Xij with Xji and otji with ajj if j < i, and similarly for 
the other variables. 

From Theorem 12.41 we know that the linear equations characterizing the 
star tree topology are the tropicalization of the generators of the ideal I 
above with respect to the trivial valuation. Moreover since this ideal is 
binomial, the tropicalization of any set of binomials generating the ideal 
constitutes a tropical basis. 

Let L be the linear space spanned by the star tree metrics. From ex- 
pression Q we know that it is generated by the set of all vertices of the 
second hypersimplex A(2,n), i.e. the exponents of our monomial map ifi: 
{ei + ej : 1 < i < j < n}. As an example, if n = 4, this polytope is the 
octahedron and for n = 5 it is a 4-dimensional polytope with 10 vertices. In 
general, A(2, n) is an (n — l)-dimensional polytope in IR n with (2) vertices. 
In particular, we conclude that the dimension of L equals n. 

Although the space of star trees is the tropicalization of a toric variety, 
the space of positive weighted star trees (i.e. "star tree metrics") is not the 
tropicalization of any algebraic variety. More precisely, it is characterized by 
equations in tropical arithmetic, but they are not tropicalizations of Laurent 
polynomials, despite the fact that the Four Point Condition is the tropical- 
ization of the polynomials XijXki + xuXjk + xn-Xji. The metric condition 
causes for this misbehavior. However the space of star tree metrics is the 
intersection of the positive orthant with the tropical variety of tropical rank 
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one symmetric matrices (i.e. the space of trees.) In the language of [TT], it 
is the positive part of this tropical variety. 

As we mentioned in Section [H mixtures of star tree metrics are closely re- 
lated to tropical secants of (tropicalizations of) toric varieties ([3].) However, 
Tropical secants of cones are the appropriate setting to analyze mixtures of 
finitely many star tree metrics. Understanding this construction could help 
us extend the results obtained in previous sections. More precisely, following 
the notation of [3], the linear space L must be replaced by a rational poly- 
hedral cone, the rows in Ml are replaced by the extremal rays of this cone, 
and we need to switch from min to max convention. Among the results that 
can be extended to the cone setting, we have: 

Theorem 4.1. [3l Thm 2.1] Let L C W 11 be a polyhedral cone with d ex- 
tremal rays. Consider the associated matrix Ml £ M. dxm whose rows equal 
the previous d rays. Let Vl = {v±, ■ ■ ■ ,%} be the n-point configuration of 
M d given by the columns of Ml- Then a vector x = (x\, . . . ,x m ) is in the 
k-th tropical secant of L (with the max convention) if and only if the lower 
envelope of the polytope formed by the height vector x has k + l facets given 
by linear functionals with only positive coefficients whose union contains 
each point of Vl ■ 

The positivity condition for the coefficients comes from the positivity 
condition for scalars describing points in cones. For simplicity, we call such 
regular subdivisions positive regular subdivisions. Other than that, the proof 
follows line by line [31 Thm 2.1]. 

In our example, we consider the cone in of star tree metrics. Follow- 
ing the notation in [3], our matrix Ml £ 1/ IX ^) w i\\ have columns 

(5) M L lj) = a + ej e R n VI < i < j < n. 

Notice that this is the integer matrix associated to the monomial map t/j. The 
extremal rays of the cone are the cut metric corresponding to the partition 
{i} U ([n] \ {i}), as we vary i £ [n]. Notice that the cone is n-dimensional 
and it has n extremal rays. 

In unpublished work |2j , the authors study star tree ranks of tropical skew- 
symmetric n x n-matrices, that is, symmetric matrices where the diagonal is 
ignored. This notion of rank corresponds to computing the minimum r s.t. 
any symmetric matrix (modulo its diagonal) can be written as a mixture of r 
star trees, where we allow negative weights. More precisely, they show that 
for n > 3, any dissimilarity map on n taxa can be written as the tropical 
mixture of n — 2 star trees, possibly with negative weights on their edges. 
Moreover, this bound is tight. By comparing this result with Lemma 12. 6( 
we see that any metric on four taxa is a tropical mixture of two star trees, 
but not necessarily two star tree metrics. Therefore, star tree metric ranks 
are often higher than star tree ranks. We will show that, moreover, star tree 
metric ranks can be infinite (e.g. a cut metric on four taxa.) 
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Another interesting question to study emerges also from [3]. By [3j 
Thm 2.1, Cor 2.2] one knows that each regular subdivision in the k-th trop- 
ical secant variety of the space of star trees corresponds to a polyhedral 
cone of height vectors. In particular, this results in a decomposition of the 
tropical secant variety ^ as a polytopal complex. In the case of the k-th 
tropical secant variety of pointed cones we have: 

Corollary 4.2. [31 Cor 2.2] The k-th tropical secant variety of a pointed cone 
L is cone from L over a polytopal complex, which we call the k-th tropical 
secant complex of L. The faces correspond to positive regular subdivisions 
of Vl in which there exist (k + 1) facets containing all of the points, with a 
face F containing a face G if the regular subdivision associated to F refines 
the one associated to G. 

Consider the spaces of star trees, of star tree metrics, and their cor- 
responding tropical secant complexes. One knows that the complex cor- 
responding to star tree metrics ^+ is a strictly contained in the complex 
corresponding to star trees c £. In particular, it would be desirable to study 
how the cell structures of both complexes relate. In addition, one can inves- 
tigate how the cell structure of ^+ compares to the subdivision of ^+ into 
topological types. 

To finish, we would like to make some comments on star tree metric ranks 
of dissimilarity maps. Since the configuration of points associated to A(2, n) 
is in convex position, [3j Cor 2.3] shows that for k S> 0, the fc-th tropical 
secant (or oo-th tropical secant) of the space of star trees equals the ambient 
space r( 2 ). Notice that the results of [2] give us n — 2 as a sharp lower 
bound on such k. However, the space of tree metrics does not equal the oo- 
th tropical secant variety of star tree metrics. A very simple example is given 
for n = 4 taxa and the cut metric (12|34), i.e. the metric defined by D(l, 3) = 
L>(1,4) = D(2,3) = D(2,4) = 1 and otherwise. Assume D = 0f =1 S k 

(k) 

where S k are star tree metrics, and denote e - the weight of the edge pendant 
to leaf j in S k . Then we have S k (l,2) = e[ h) + e { 2 k) < -D(l,2) = and 
S k (3, 4) = + < D(3, 4) = 0. By the positivity condition on the edge 

(k) 

weights, we conclude e - = for all a contradiction. 

In general, we consider any cut metric Oil Tl tclXcl, given by a partition with 
at least two subsets are not singletons. Then, by conveniently restricting D 
to a metric on a four-element set we conclude that D does not lie in oo-th 
secant variety of the space of star tree metrics, so the latter is not the space 
of metrics. Moreover, the same example shows that we cannot cover all tree 
metrics by tropical mixtures of finitely many star tree metrics. In particular, 
these results together with [31 Cor 2.3] imply that regular subdivisions of 
the second hypersimplex need not be positive, as one would expect from the 
constructions. 

As we see from the previous discussion, mixtures of star trees and of star 
tree metrics have completely different behavior, although the underlying 
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combinatorics in both cases are closely related. Basic questions on the latter 
remain open. Among these are the characterization of fc-mixtures of star tree 
metrics, and membership tests for these sets. We believe that a serious study 
of tropical secants of cones will help us attack most of these open problems. 
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